Identifying Coevolving Partners from Paralogous Gene Families

نویسنده

  • Chen-Hsiang Yeang
چکیده

Many methods have been developed to detect coevolution from aligned sequences. However, all the existing methods require a one-to-one mapping of candidate coevolving partners (nucleotides, amino acids) a priori. When two families of sequences have distinct duplication and loss histories, finding the one-to-one mapping of coevolving partners can be computationally involved. We propose an algorithm to identify the coevolving partners from two families of sequences with distinct phylogenetic trees. The algorithm maps each gene tree to a reference species tree, and builds a joint state of sequence composition and assignments of coevolving partners for each species tree node. By applying dynamic programming on the joint states, the optimal assignments can be identified. Time complexity is quadratic to the size of the species tree, and space complexity is exponential to the maximum number of gene tree nodes mapped to the same species tree node. Analysis on both simulated data and Pfam protein domain sequences demonstrates that the paralog coevolution algorithm picks up the coevolving partners with 60% 88% accuracy. This algorithm extends phylogeny-based coevolutionary models and make them applicable to a wide range of problems such as predicting protein-protein, protein-DNA and DNA-RNA interactions of two distinct families of sequences.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The human protein coevolution network.

Coevolution maintains interactions between phenotypic traits through the process of reciprocal natural selection. Detecting molecular coevolution can expose functional interactions between molecules in the cell, generating insights into biological processes, pathways, and the networks of interactions important for cellular function. Prediction of interaction partners from different protein fami...

متن کامل

Coevolution of gene families in prokaryotes.

We study gene family coevolution on a tree of life based on a large-scale ancestral gene content reconstruction, which includes gene duplication and deletion events. The insights obtained from this study are threefold: (1) Global properties, such as the distribution of coevolution partners and the formation of disconnected clusters of coevolving families, can be an inevitable consequence of evo...

متن کامل

Correlated Evolution among Six Gene Families in Drosophila Revealed by Parallel Change of Gene Numbers

Proteins involved in a pathway are likely to evolve in a correlated fashion, and coevolving gene families tend to undergo complementary gains and losses. Accordingly, gene copy numbers (i.e., repertoire size) tend to show parallel changes during the evolution of coevolving gene families. To test and verify this hypothesis, here we describe positive correlations among the repertoire sizes of six...

متن کامل

Systematic identification of functional orthologs based on protein network comparison.

Annotating protein function across species is an important task that is often complicated by the presence of large paralogous gene families. Here, we report a novel strategy for identifying functionally related proteins that supplements sequence-based comparisons with information on conserved protein-protein interactions. First, the protein interaction networks of two species are aligned by ass...

متن کامل

رواسازی فهرست وارسی شریکان زندگی: اندازه گیری مشکلات تجربه شده اعضای خانواده سوء مصرف کنندگان مواد مخدر

Objective: The present study aimed at validating partners' checklist among the family members of drug abusers. Method: A descriptive research design was used in this study. The number of 397 participants of the families referring to addiction clinics in Semnan Province was randomly selected as the sample units. Results: Reliability in the frequency range was variable from .57 (healthy) to .81 (...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Evolutionary Bioinformatics Online

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2008